# Multimodal adaptation
Webssl Dino7b Full8b 378
A 7-billion-parameter vision Transformer model trained on 8 billion language-unlabeled web images, achieving exceptional visual representation capabilities through self-supervised learning
Image Classification
Transformers

W
facebook
68
0
Tiny Random Phi 4 Multimodal
This is a tiny model for debugging, randomly initialized based on the adjusted configuration, specifically designed for rapid process verification.
Image-to-Text
Transformers

T
katuni4ka
41.78k
0
Aimv2 1b Patch14 224.apple Pt
AIM-v2 is an image encoder model based on the timm library, with a scale of 1 billion parameters, suitable for image feature extraction tasks.
Image Classification
Transformers

A
timm
198
0
Featured Recommended AI Models